Picture for Yitong Wang

Yitong Wang

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

Add code
Jan 05, 2026
Viaarxiv icon

DreamOmni3: Scribble-based Editing and Generation

Add code
Dec 27, 2025
Viaarxiv icon

MMSRARec: Summarization and Retrieval Augumented Sequential Recommendation Based on Multimodal Large Language Model

Add code
Dec 24, 2025
Figure 1 for MMSRARec: Summarization and Retrieval Augumented Sequential Recommendation Based on Multimodal Large Language Model
Figure 2 for MMSRARec: Summarization and Retrieval Augumented Sequential Recommendation Based on Multimodal Large Language Model
Figure 3 for MMSRARec: Summarization and Retrieval Augumented Sequential Recommendation Based on Multimodal Large Language Model
Figure 4 for MMSRARec: Summarization and Retrieval Augumented Sequential Recommendation Based on Multimodal Large Language Model
Viaarxiv icon

Animate Any Character in Any World

Add code
Dec 18, 2025
Viaarxiv icon

Why Did Apple Fall To The Ground: Evaluating Curiosity In Large Language Model

Add code
Oct 23, 2025
Viaarxiv icon

Real, Fake, or Manipulated? Detecting Machine-Influenced Text

Add code
Sep 18, 2025
Viaarxiv icon

OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning

Add code
Aug 28, 2025
Figure 1 for OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning
Figure 2 for OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning
Figure 3 for OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning
Figure 4 for OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning
Viaarxiv icon

DreamVE: Unified Instruction-based Image and Video Editing

Add code
Aug 08, 2025
Viaarxiv icon

DreamLight: Towards Harmonious and Consistent Image Relighting

Add code
Jun 17, 2025
Viaarxiv icon

Private Transformer Inference in MLaaS: A Survey

Add code
May 15, 2025
Viaarxiv icon